Picture for Ziqi Zhao

Ziqi Zhao

ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning across Domains

Add code
May 27, 2026
Viaarxiv icon

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

Gym-V: A Unified Vision Environment System for Agentic Vision Research

Add code
Mar 17, 2026
Viaarxiv icon

LoR-LUT: Learning Compact 3D Lookup Tables via Low-Rank Residuals

Add code
Feb 26, 2026
Viaarxiv icon

Reinforced Efficient Reasoning via Semantically Diverse Exploration

Add code
Jan 08, 2026
Viaarxiv icon

A Multi-scale Representation Learning Framework for Long-Term Time Series Forecasting

Add code
May 13, 2025
Viaarxiv icon

Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure

Add code
Apr 18, 2025
Viaarxiv icon

HealthiVert-GAN: A Novel Framework of Pseudo-Healthy Vertebral Image Synthesis for Interpretable Compression Fracture Grading

Add code
Mar 08, 2025
Viaarxiv icon

A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition

Add code
Feb 25, 2025
Viaarxiv icon

Omni Differential Drive for Simultaneous Reconfiguration and Omnidirectional Mobility of Wheeled Robots

Add code
Dec 14, 2024
Viaarxiv icon